Improving zero-shot retrieval using dense external expansion
نویسندگان
چکیده
Pseudo-relevance feedback (PRF) is a classical technique to improve search engine retrieval effectiveness, by closing the vocabulary gap between users’ query formulations and relevant documents. While PRF typically applied on same target corpus as final retrieval, in past, external expansion techniques have sometimes been obtain high-quality pseudo-relevant set using . However, such approaches only studied for sparse (BoW) methods, its effectiveness recent dense methods remains under-investigated. Indeed, ANCE ColBERT, which conduct similarity based encoded contextualised document embeddings, are of increasing importance. Moreover, pseudo-relevance mechanisms proposed further enhance effectiveness. In particular, this work, we examine application zero-shot i.e. evaluation corpora without training. Zero-shot experiments with six datasets, including two TREC datasets four BEIR when applying MSMARCO passage collection corpus, indicate that obtaining documents ColBERT can significantly NDCG@10 (by upto 28%) 12%). addition, brings 30% improvements 29% retrieval. • Dense improves performance. High quality be retrieved expansion. Experimental results show significant ColBERT-PRF ANCE-PRF
منابع مشابه
Query Expansion using External Resources for Improving Information Retrieval in the Biomedical Domain
This paper presents the first participation of the ERIAS team in task 3 of the ShARe/CLEF eHealth Evaluation Lab 2014. The goal of this task is to evaluate the effectiveness of Information Retrieval systems to support patients in accessing easily relevant information. We propose a method which exploits external resources for improving information retrieval in the biomedical domain. The proposed...
متن کاملImproving Retrieval Using External Annotations: OHSU at ImageCLEF 2010
Over the past several years, our team has focused its efforts on improving retrieval precision performance by mixing visual and textual information. This year, we chose to explore ways in which we could use external data to enrich our retrieval system’s data set; specifically, we annotated each image in the test collection with a set of MeSH headings from two different sources: human-assigned M...
متن کاملImproving Query Expansion for Information Retrieval Using Wikipedia
Query expansion (QE) is one of the key technologies to improve retrieval efficiency. Many studies on query expansion with relationships from single local corpus suffer from two problems resulting in low retrieval performance: term relationships are limited and unlisted query terms have no expansion terms. To address these problems, relationships between terms captured from Wikipedia are superim...
متن کاملImage-Mediated Learning for Zero-Shot Cross-Lingual Document Retrieval
We propose an image-mediated learning approach for cross-lingual document retrieval where no or only a few parallel corpora are available. Using the images in image-text documents of each language as the hub, we derive a common semantic subspace bridging two languages by means of generalized canonical correlation analysis. For the purpose of evaluation, we create and release a new document data...
متن کاملDARLA: Improving Zero-Shot Transfer in Reinforcement Learning
S in Fig 2. The setup was a seek-avoid style task, where one of the two object types in the room gave a reward of +1 and the other gave a reward of -1. The agent was allowed to pick up objects for 60 seconds after which the episode would terminate and a new one would begin; if the agent was able to pick up all the ‘good’ objects in less than 60 seconds, a new episode was begun immediately. The ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information Processing and Management
سال: 2022
ISSN: ['0306-4573', '1873-5371']
DOI: https://doi.org/10.1016/j.ipm.2022.103026